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Abstract 

We present a class of linear programming approximations for mixed-integer polynomial op¬ 
timization problems that take advantage of structured sparsity of the constraint matrix. In 
particular, we show that if the intersection graph of the constraints has tree-width bounded by 
a constant, then for any desired tolerance there is a linear programming formulation of polyno¬ 
mial size. Via an additional reduction, we obtain a polynomial-time approximation scheme for 
the “AC-OPF” problem on graphs with bounded tree-width. These constructions partly rely on 
a general construction for pure binary optimization problems where individual constraints are 
available through a membership oracle; if the intersection graph for the constraints has bounded 
tree-width our construction is of linear size and exact. This improves on a number of results in 
the literature, both from the perspective of formulation size and generality. 


1 Introduction 

A fundamental paradigm in the solution of integer programming and combinatorial optimization 
problems is the use of extended, or lifted, formulations, which rely on the binary nature of the 
variables and on the structure of the constraints to generate higher-dimensional convex relaxations 
with provably strong attributes. In this paper we consider mixed-integer polynomial optimization 
problems. We develop a reformulation operator which relies on the combinatorial structure of the 
constraints to produce linear programming approximations which attain provable bounds. A major 
focus is on polynomial optimization problems over networks and our main result in this context 
(Theorem [7] below) implies as a corollary that there exist polynomial-size linear programs that 
approximate the AC-OPF problem and the fixed-charge network flow problem on bounded tree- 
width graphs. 

Our work relies on the concepts of intersection graph and tree-width ; as has been observed before 
lfl2j. 39], [35], [56], [54]). the combination of these two concepts makes it possible to define a 
notion of structured sparsity in an optimization context that we will exploit here (see below for 
more references). The intersection graph of a system of constraints is a central concept originally 
introduced in (24j and which has been used by many authors, sometimes using different terminology. 

Definition 1 The intersection graph of a system of constraints is the undirected graph which 
has a vertex for each variable and an edge for each pair of variables that appear in any common 
constraint. 

Example 2 Consider the system of constraints on variables X\,X 2 ,Xs and X 4 . 

(1) 3xf - log 2 (l + x 2 ) > 0, (2) -2xl + (I+Z 3) 3 > 0, (3) ^ 3 X 5 = 0 m 

( 5)^3 — 2 — xl < 0, (5) (ar, X 4 ) € A 

Where A is some arbitrary set. Then the intersection graph has vertices {1,2,3,4, 5} and edges 
{1,2}, {2,3}, {3,4}, {3,5} 

Definition 3 An undirected graph has tree-width < k if it is contained in a chordal graph with 
clique number < k + 1 . 
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The tree-width concept was explicitly defined in [?8] (also see [J9]), but there are many equivalent 
definitions. An earlier discussion is found in [3UJ and closely related concepts have been used by 
many authors under other names, e.g. the “running intersection” property, and the notion of “partial 
k-trees”. An important known fact is that an n-vertex graph with tree-width < k has 0(k 2 n) edges, 
and thus low tree-width graphs are sparse, although the converse is not true. In the context of this 
paper, we can exploit structural sparsity in an optimization problem when the tree-width of the 
intersection graph is small. We also note that bounded tree-width can be recognized in linear time 

m- 

Our focus, throughout, is on obtaining polynomial-size LP formulations. The construction of 
such “compact” formulations is a goal of fundamental theoretical importance and quite separate 
from the development of polynomial-time algorithms. From a numerical perspective, additionally, 
the representation of a problem as an LP permits the use of practical bounding techniques such as 
cutting-plane and column-generation algorithms. We first prove: 

Theorem 4 Consider a mixed-integer, linear objective, polynomially constrained problem 

(PO): min c T x (2a) 

subject to: fi{%) >0 1 < i < m (2b) 

Xj G {0,1} 1 < j < p, Xj €[0,1] p + 1 < j < n. (2c) 

Let 7T and F be such that for 0 < i < m, fi(x) has maximum degree at most n and Li-norm of 
coefficients at most F. If the intersection graph of the constraints has tree-width < w then for any 
0 < e < 1, there is a linear programming formulation with O ((27r/e)“ +1 nlog(7r/e)) variables and 
constraints that solves V within feasibility tolerance Fe and optimality tolerance ||c||ie. 

Below we will provide an extended statement for this result, as well as a precise definition of 
‘tolerance’. However, the statement in Theorem[4]is indicative of the fact that as e -> 0 we converge 
to an optimal solution, and the computational workload grows proportional to 0 (e~ ul ~ 1 loge _1 ). 
Moreover, as we will argue, it is straightforward to prove that unless P = NP , no polynomial time 
algorithm for mixed-integer polynomial optimization exists that improves on the dependence on 
e given by Theorem [4] As far as we know this theorem is the first to provide a polynomial-size 
formulation for polynomial optimization problems with guaranteed bounds. 

Our next result is motivated by recent work on the AC-OPF (Optimal Power Flow) problem in 
electrical transmission m, m, m- a generic version of this problem can be succinctly described 
as follows. We are given an undirected graph G where for each vertex u G V(G) we have two 
variables, e u and f u . Further, for each edge {u,v} we have four 2x2 matrices M uv , M vu , N uv and 
N vu . For each edge h = {it, r>}, write Wh = (e„, f u , e v , f v ) T . Then we have 


(AC-OPF): 

min 

£ Cu 9 2 n 


(3a) 



u£V 



subject to: 

Lu 

< £_ 

w T h M uv w h < U u Vu G V(G) 

(3b) 



h={u,v}^S(u 

0 



L'u 

< £ 

w%N uv w h < U’ u VmgF(G) 

(3c) 



h={u,v}(z5(u 

0 



B u 

< ll( e M, fu) T \ 

< A u \/u G V(G) 

(3d) 


9u 

= £ 

WfrM uv w h Vw G V(G). 

(3e) 


h—{u,v}^S(u) 


In this formulation L u , L' U ,U U ,U' U , B u , A u and c u are given values, and the g u are auxiliary variables 
defined as per (3e|. Here and below, given a graph H its vertex set is V{H) and its edge set is 
E(H), and for u € V(H) we use S(u) = Sh(u) to denote the set of edges incident with u and write 
deg(w) = deg ff (w) = \S(u)\. 

As a generalization of AC-OPF, we consider network mixed-integer polynomial optimization prob¬ 
lems (NPOs, for short). These are PO problems with an underlying network structure specified by 
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a graph G. Specifically, we assume for each vertex v in G there is a set X v of variables associated 
with v. Moreover, each constraint is associated with one vertex of G; a constraint associated with 
vertex u takes the form 


Y PuA X uGX v ) > 0, (4) 

{u,v}£5(u) 

where S(u) is as defined above and each p u , v is a polynomial. Note that this definition allows a vertex 
v to have many constraints of the type © associated with it. The sets X v are not assumed to be 
pairwise disjoint, and thus a given variable may appear in several such sets; however, for technical 
reasons, we assume that for any variable Xj the set {v £ V(G) : Xj £ X v } induces a connected 
subgraph of G. Clearly AC-OPF is an NPO (with the X v pairwise disjoint), and it can also be 
shown that optimization problems on gas networks 32j are NPOs, as well. 

Yet another example is provided by the classical capacitated fixed-charge network flow problem 
(see m) which has received wide attention in the mixed-integer programming literature. In the 
simplest case we have a directed graph G; for each vertex u we have a value b u and for each arc 
(it, v) we have values f uv , c uv and w uv . The problem is 

(FCNF): min E fuvDuv T c uv x uv (5a) 

(u,v) 

subject to: Nx = b, (5b) 

0 x uv ft w uv y uv , y u , v £ {0? 1} V(it,i:). (5c) 

where N is the node-arc incidence matrix of G. This is an NPO (e.g. associate each x u>v with either 
it or v). When G is a caterpillar (a path with pendant edges) it includes the knapsack problem as 
a special case. FCNF can arise in supply-chain applications, where G will be quite sparse and often 
tree-like. 

Above (Theorem [4]) we have focused on exploiting the structure of the intersection graph for a 
problem; as we discussed this graph is obtained from a formulation for the problem. However, in 
NPOs there is already a graph, which in the above examples frequently has moderate tree-width, 
and it is this condition that we would like to exploit. In fact, recent work ([44], m) develops faster 
solutions to SDP relaxations of AC-OPF problems by leveraging small tree-width of the underlying 
graph. To highlight the difference between the two graphs, consider the following examples: 

Example 5 Consider the NPO with constraints 

xl+x^ + Zxl < 1, x\ — x\ + Xi > 0, 2 : 32:4 + x\ — Xq > 1/2 

0 < 2q < 1 , 0 < X2 < 1 , 0 < 2:3 < 1 , 0 < 2:5 < 1 , 0 < Xq < 1, 2:4 £ {0, 1 }. 

Here, variables x\ and X 2 are associated with vertex a of the graph in Figure [7J and variables X 3 , 
2:4, X 5 and Xq are associated with vertices b,c,d and e, respectively. The first constraint involves 
variables associated with the endpoints of edge {a, b}, the second concerns edges {a, 5} and {a, c}, 
and the third concerns edges {b, c},{b, e} and {b, d}. 



Figure 1: (Example [5| (a) Network polynomial optimization problem, (b) The intersection graph. 
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Example 6 Consider a knapsack problem min{ c T x : a T x > b, x £ {0,1}"}. This is a NPO , 
using a star network on n+1 vertices, which has tree-width 1. Yet, ifaj ^ 0 for all j, the intersection 
graph is a clique of size n. Note that we can restate x £ {0,1}" as xfl 1 — xf) = 0, \/j. 


In fact, even an AC-OPF instance, on a tree, can give rise to an intersection graph with high tree- 
width, because constraint (3b) or (3c) mirrors the intersection graph behavior in Example [hj 


In the general case, in order to avoid the possible increase in tree-width going from the underlying 
graph to the intersection graph, we show that given a NPO problem on a graph of small tree-width, 
there is an equivalent NPO problem whose intersection graph also has small tree-width. As a result 
of this elaboration we obtain: 


Theorem 7 Consider a network mixed-integer polynomial optimization problem over a graph G 
of tree-width < uj and maximum-degree D, over n variables, and where every polynomial p U}V has 
maximum degree < 7r. Suppose that the number of variables plus the number of constraints associated 
with any vertex of G is at most A. Given 0 < e < 1, there is a linear programming formulation of 
size 0((.D7r/e) 0 ( A “) n log(7r/e)) that solves the problem within scaled tolerance e. 

Here, “scaled tolerance” embodies the same notion of optimality and feasibility approximation as in 
Theorem [4] Later we will discuss why this approximation feature is needed. Further, we note that 
in the case of the AC-OPF problem we have A = 5 and 7r = 2. 

Corollary 8 There exist polynomial-time approximation schemes for the AC-OPF problem on graphs 
with bounded tree-width, and for the capacitated fixed-charge network flow problem on graphs with 
bounded tree-width. 

Our third result is important toward the proof of Theorem [4] but is of independent interest. As 
we will see, the construction in Theorem [4] approximates a mixed-integer polynomially constrained 
problem with a polynomially constrained pure binary polynomially constrained problem. As a 
generalization, we study “general” binary problems, or GB for short, defined as follows. 

(i) There are n variables and m constraints. For 1 < * < to, constraint i is characterized by a 
subset K[i] C {1,.. . ,n} and a set S l C {0, 1} K 'W. Set S l is implicitly given by a membership 
oracle, that is to say a mechanism that upon input y £ {0,1}*"M, truthfully reports whether 
y&S\ 

(ii) The problem is to minimize a linear function c T x, over x £ {0,1}", and subject to the con¬ 
straint that for 1 < i < to the sub-vector x^u] is contained in S l . 

Any linear-objective, binary optimization problem whose constraints are explicitly stated can be 
recast in the form GB; e.g., each set S l could be described by a system of algebraic equations in 
the variables Xj for j £ K[i\. GB problems are related to classical constraint satisfaction problems, 
however the terminology above will prove useful later. A proof of part (a) in the following result can 
be obtained using techniques in |39] (Section 8) although not explicitly stated there. We will outline 
this proof, which relies on the “cone of set-functions” approach of [42] and also present a new proof. 

Theorem 9 Consider a GB problem whose intersection graph has tree-width < to. 

(a) There is an exact linear programming formulation with 0[2 u n) variables and constraints, with 
{ 0 , 1 , — 1 }-valued constraint coefficients. 

(b) The formulation can be constructed by performing 2 u, m oracle queries and with additional 

workload 0 *(uj 2“ J2t£V(T) I {* ■ ^ Qt} I + aimn) = O*(co 2 u mn + u 2 2 u n), where the 

notation indicates logarithmic factors in m or n. 
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Note that the size of the formulation is independent of the number constraints in the given in¬ 
stance of GB. And even though we use the general setting of membership oracles, this theorem gives 
an exact reformulation, as opposed to Theorems [4] and [7J where an approximation is required unless 
P=NP. Theorem [9] has additional implications toward linear and polynomial binary optimization 
problems. We will examine these issues in Section |4j Regarding part (b) of the theorem, it can be 
shown that 2 w m is a lower bound on the number of oracle queries that any algorithm for solving 
GB must perform. 

Theorem [9] describes an LP formulation for GB; the relevance of this focus was discussed above. 
Together with the reductions used to obtain Theorems [4] and [7] we obtain approximate LP formu¬ 
lations for polynomial (resp., network polynomial) mixed-integer problems. Of course, Theorem 
[9] also implies the existence of an algorithm for solving GB in time polynomial in 0( 2“’(n + m)). 
However one can also derive a direct algorithm of similar complexity using well-known, prior ideas 
on polynomial-time methods for combinatorial problems on graphs of bounded tree-width. 

1.0.1 Prior work 

There is a broad literature dating from the 1980s on polynomial-time algorithms for combinatorial 
problems on graphs with bounded tree-width. An early reference is |3J. Also see 0, 0, HBJ, 0, 
m m and from a very general perspective, [IB] . These algorithms rely on “nonserial dynamic 
programming”, i.e., dynamic-programming on trees. See n, m, 0- 

A parallel research stream concerns “constraint satisfaction problems”, or CSPs. Effectively, the 
feasibility version of problem GB is a CSP. One can also obtain an algorithm for problem GB, with 
similar complexity, and relying on similar dynamic programming ideas as the algorithms above, from 
the perspective of belief propagation on an appropriately defined graphical model. Another central 
technique is the tree-j unction theorem of [401 , which shows how a a set of marginal probability 
distributions on the edges of a hypertree can be extended to a joint distribution over the entire 
vertex set. Early references are m, m and [22] ■ Also see [53], [T2] (and references therein), and 

SSI- 

Turning to the integer programming context, [13] (also see the PhD thesis [57]) develop extended 
formulations for binary linear programs by considering the subset algebra of feasible solutions for 
individual constraints or small groups of constraints; this entails a refinement of the cone of set- 
functions approach of [32]. The method in m is similar to the one in this paper, in that here 
we rely on a similar algebra and on extended, or “lifted” reformulations for 0/1 integer programs. 
The classical examples in this vein are the reformulation-linearization technique of m, the cones of 
matrices method [42] , the lift-and-project method of [6], and the moment relaxation methodology 
of [37] . See [38] for a unifying analysis; another comparison is provided in [5j. 

The work in m considers packing binary integer programs are considered, i.e. problems of the 
form 


max{c T ir : Ax < b, x £ {0,1}"} (6) 

where A > 0 and integral and b is integral. Given a valid inequality ax > /3 , its associated graph is 
the subgraph of the intersection graph induced by S = {1 < j < n : ay 7^ 0}; i.e. it has vertex-set 
S and there an edge k} £ S x S whenever a,j 7^ 0 and 7^ 0 for some row i. 

In [12] it is shown that given and ui > 1, the level-w Sherali-Adams reformulation of ([6]) implies 
every valid inequality whose associated graph has tree-width < u — 1. Further, if A is 0/1-valued, 
the same property holds when the associated graph has tree-width < oj. As a corollary, given a 
graph G with tree-width < w, the Sherali-Adams reformulation of the vertex packing linear program 
{x £ [0,1]' : x u + x v < 1 V{u, u} £ E(G)}, which has O(con u+2 ) variables and constraints, is 

exact. As far as we know this is the first result linking tree-width and reformulations for integer 
programs. A different result which nevertheless appears related is obtained in [20] . 

In [541 . binary polynomial optimization problems are considered, i.e problems as min{ c T x : x £ 
{0,1}”, gi(x) >0, 1 < * < M} where each gi(x) is a polynomial. They show that if the tree- 
width of the intersection graph of the constraints is < w, then the level-w Sherali-Adams or Lasserre 
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reformulation of the problem is exact. Hence there is an LP formulation with 0(n u>+2 ) variables 
and 0(n u+2 M) constraints. 

A comprehensive survey of results on polynomial optimization and related topics is provided in 
[55] . Section 8 of [35] builds on the work in (33], which provides a common framework for the Sherali- 
Adams, Lovasz-Schrijver and Lasserre reformulation operators. In addition to the aforementioned 
results related to problem GB (to which we will return later), [35] explicitly shows that the special 
case of the vertex-packing problem on a graph with n vertices and tree-width < oj has a formulation 
of size 0{2 u n)\ this is stronger than the implication from [12] discussed above. Similarly, it is shown 
in [39] that the max-cut problem on a graph with n vertices and tree-width < oj has a formulation 
of size 0 {2 ul n). 

In the continuous variable polynomial optimization setting, [33], and [53 ] present methods for 
exploiting low tree-width of the intersection matrix e.g. to speed-up the sum-of-squares or moment 
relaxations of a problem. Also see [28] and Section 8 of [39]. [35] considers polynomial optimization 
problems as well. In abbreviated form, [35] shows that where p is the tree-width of the intersection 

graph, there is a hierarchy of semidefinite relaxations where the r th relaxation (r = 1,2,.) has 

0(np 2r ) variables and 0(n + m) LMI constraints; further, as r —> +oo the value of the relaxation 
converges to the optimum. Also see [33] and [33]. 

Finally, there are a number of results on using lifted formulations for polynomial optimization 
problems, along the lines of the RLT methodology of [50]. See [52], [51] and references therein. 


1.0.2 Organization of the paper 


This paper is organized as follows. Mixed-integer polynomial optimization problems and a proof of 
Theorem 4 are covered in Section [2] Network mixed-integer polynomial optimization problems and 
Theorem 7 are addressed in Section [3] And finally, in Section [4] we will present a detailed analysis 
of the pure binary problems addressed by Theorem [9] and a proof of this result. 


2 Mixed-integer polynomial optimization problems 

In this section we consider mixed-integer polynomial optimization problems (PO) and prove a result 
(Theorem [l5j below) that directly implies Theorem [4] given in the introduction. This proof will 
make use of the result in Theorem [3] to be proven in Section [4] 

In what follows we will rely on a definition of tree-width which is equivalent to Definition [3] This 
definition makes use of the concept of tree-decomposition: 

Definition 10 Let G be an undirected graph. A tree-decomposition pffl , \49\j of G is a pair 
( T,Q) where T is a tree and Q = {Qt : t £ V(T)} is a family of subsets ofV(G) such that 

(i) For all v £ V(G), the set {t £ V(T) : v £ Qt] forms a subtree T v of T, and 

(ii) For each {u,'r} £ E(G) there is a t £ V(T) such that {w,r'} C Q t , i.e. t £ T u D T v . 

The width of the decomposition is max{|Q t | : t £ V(T)} — 1. The tree-width of G is the minimum 
width of a tree-decomposition ofG. See Example\l 1\ 

Example 11 (Tree-decomposition) Consider the intersection graph G arising in Example^ See 
Figure^a). A tree-decomposition with tree T is shown in Figure^b)-(c). 

Since this definition relates a specific decomposition of the graph with its tree-width, many of 
the arguments we provide will rely on modifying or creating valid tree-decompositions that attain 
the desired widths. 

We make some remarks pertaining to to the PO problem. Throughout we will use the definition 
of used in the introduction (formulation |2|). The i th constraint, for 1 < i < to, is given by fi(x) > 0, 
where fi{x) is a has the form 

fi{x) = h<*x a - (3) 

ctGl(i) 
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Figure 2: A tree-decomposition, (a) Graph G. (b) Tree T. (c) Tree T with subtrees T v . 


Here I[i ) C {1, ..., n} is a finite set, /i iQ is rational and x a is a monomial in x : 

n 

= n*?- 

3 =1 


Finally, for each 1 < i < m we will denote as ||/;||i the 1-norm of the coefficients of polynomial 

/»(*), i-e 

\\fih= E IA«I- 

aEl(i) 

Any linear-objective mixed-integer polynomial optimization problem where the feasible region is 
compact can be reduced to the form § by appropriately translating and scaling variables. 

Remark 12 A polynomial-optimization problem with nonlinear objective can trivially be made into 
the form by (for example) using a new variable and two constraints to represent each monomial 
in the objective. Of course, such a modification may increase the tree-width of the intersection graph. 


Now we precisely define what the intersection graph would be in this context. 


Definition 13 Given an instance of problem PO, let its intersection graph T be the undirected graph 
with n vertices and where for 1 < i < m the set 

{j : oij ^ 0, f i>a ± 0, a e I(i)j 


induces a clique. 

Definition 14 Consider an instance of problem PO. 

(a) Given e > 0, we say a vector x £ {0,1} P x [0, l\ n ~ p is scaled-e feasible if 

fi(x) > —e||/j||i, 1 < i < m. (8) 

(b) We set tt A maiKK™ max a6/(i) E"= P +i a j- 
We will prove the following result: 

Theorem 15 Given an instance of PO, let to be the width of a tree-decomposition of the intersection 
graph. For every 0 < e < 1 there is a linear program 

LP : min{c T ?/ : Ay > b} 


with the following properties: 

(a) The number of variables and constraints is O ((27r/e) CJ+1 nlog(7r/e)) ; and all coefficients are of 
polynomial size. 

(b) Given any feasible solution x to PO, there is a feasible solution y to LP with 

c T y < c T x + e||c||i. 
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(c) Given an optimal solution y* to LP, we can construct x* £ {0,1} P x [0, l] n p such that: 


1. x* is scaled-e feasible for PO, and 

2 . c x = c y . 

Remark 16 Assume PO is feasible. Then (b) shows LP is feasible and furthermore by (c) solving 
LP yields a near-feasible solution for PO which may be superoptimal, but is not highly suboptimal. 
Condition (a) states that the formulation LP is of pseudo-polynomial size. 


To prove Theorem 15 we will rely on a technique used in (26] ; also see m and [2Tj, [29] and 
citations therein. Suppose that 0 < r < 1. Then we can approximate r as a sum of inverse powers 
of 2. Let 0 < 7 < 1 and 

L = L(j) = [log 2 7 ~ 1_ | ■ 

Then there exist 0/1-values Zh, 1 <h < L, with 


^ Z 2 ~ kz h < r < J2 2 ~ hzh + 2 ~ L < ^Z 2 ~ hz h + 7 < I- 


(9) 


h=l 


h =1 


h =1 


Next we approximate problem PO with a problem of type GB. For each 1 < i < m and a G I(i) we 
write 

Z(i, a) = {j : otj £ 0 , 1 <j< p}. 

In other words, set Z(i, a) is the set given by the indices of the binary variables for PO that appear 
explicitly in monomial x a ; thus for j £ Z(i,a) we have x^ J = Xj. Let 

6 - 6( 7 ,tt) = l-(l- 7 ) 7r . ( 10 ) 

The theorem will be obtained by using the following formulation, for appropriate 7 = 7 (e) : 


(GB( 7 )) : 

V 

min 

n / L 

x :i + 2 kz :i’ h 


i=i 

J—p +1 \h=l 



n f L 

s.t. 

fLa 

n n i^ 2 2 hz 


a£/(i) 

jeZ(i,a) j=P+ 1 \/i=l 


zph | > -£||/»||i, 1 < * < m 

Xj £ {0,1}, 1 < j < p, z J:h £ {0,1}, Vjf £ {p + 1,..., n}, 1 < h < L. 

Remark. This formulation replaces, in PO, each continuous variable Xj with a sum of powers of 
1/2, using the binary variables Zj t h in order to effect the approximation ([ 9 ]). 


To prove the desired result we first need a technical property. 

Lemma 17 Suppose that for 1 < k < r we have values Uk >0, Vk > 0, qu £ Z + with Uk+Vk < 1. 
Then 

r r r 

IpWfc+Vfc)‘ Z ' i - Yl U k k ~ 1 P(f - V k ) qk . 

k =1 k =1 k=1 

Proof. Take any fixed index 1 < i < r. The expression 

r r 

JpWfc +V k ) qk - Iptt/c) 9 ' 1 
fc=l fc=l 

is a nondecreasing function of m when all Uk and Vk are nonnegative, and so in the range 0 < iq < 
1 — Vi it is maximized when Ui = 1 — v t . M 


Using this fact, we can now show: 





Lemma 18 


(a) Suppose x is a feasible for PO. Then there is feasible solution for GB("/) with objective value 
at most c T x + <5||c||i. 


(b) Conversely, suppose (x,z) is feasible for GB ( 7 ). Writing, for each p + 1 < j < n, Xj = 
2~ h Zj,h, we have that x is scaled-S-feasible for PO and c T x = c i (j2h =1 2~ h Zj,h^J ■ 

Proof, (a) For each j choose binary values z h h so as to attain the approximation in ([ 9 ]). Then for 
each 1 < i <m and a G I{i) we have 


II E 2 hz i’ h 


< 


n 


< n E 2 

j=p+1 \h=l 


— h 


z .i,h 


s. 


j=P+ 1 \h=l / j=P+ 1 

Here the left-hand inequality is clear, and the right-hand inequality follows from Lemma [T7| and the 


definition (10) of 6 . Thus z is feasible for GB( 7 ) and the second assertion is similarly proved. 

(b) Follows by construction. ■ 

We can now complete the proof of Theorem [l5j Given an instance of problem PO together 
with a tree-decomposition of its intersection graph, of width w, we consider formulation GB( 7 ) for 
7 = £ 7 r -1 . As an instance of GB, the formulation has at most nL( 7 ) variables and its intersection 
graph has width at most (u + 1 )^( 7 ) — 1. To see this point, consider a tree-decomposition ( T,Q ) 
of the intersection graph for PO. Then we obtain a tree-decomposition (T, Q) for GB( 7 ) by setting, 
for each t G V(T), 

Qt = {Zj,h : 1 <h< L( 7 ), j G Qt}. 

We then apply, to this instance of GB, Theorem [9] We obtain an exact , continuous linear 
programming reformulation for GB( 7 ) with 

0( 2 ("+i) L WnL(7)) = 0((27r/e) aj+1 nlog(7r/e)) 

variables and constraints. In view of Lemma [l 8 j and the fact that (5=1 — (1 — 7 )'"' < n'y = e, the 
proof of Theorem |To] is complete. 


2.0.3 Can the dependence on e be improved upon? 

A reader may wonder why or if “exact” feasibility (or optimality) for PO cannot be guaranteed. 
From a trivial perspective, we point out that there exist simple instances of PO (in fact convex, 
quadratically constrained problems) where all feasible solutions have irrational coordinates. Should 
that be the case, if any algorithm outputs an explicit numerical solution in finite time, such a solution 
will be infeasible. A different perspective is that discussed in Example [ 6 j As shown there we cannot 
expect to obtain an exact optimal solution in polynomial time, even in the bounded tree-width case, 
and even if there is a rational optimal solution, unless P = NP. 

To address either issue one can, instead, attempt to output solutions that are approximately 
feasible. The approximation scheme given by Theorem 15 has two characteristics: first, it allows 
a violation of each constraint by e times the 1 -norm of the constraint, and second, the running 
time is pseudopolynomial in e _1 . One may wonder if either characteristic can be improved. For 
example, one might ask for constraint violations that are at most e, independent of the 1 -norm of the 
constraints. However this is not possible even for a fixed value of e, unless P=NP. For completeness, 
we include a detailed analysis of this fact in Section[A]of the Appendix. Intuitively, if we were allowed 
to approximately satisfy every constraint with an error that does not depend on the data, we could 
appropriately scale constraint coefficients so as to obtain exact solutions to NP-hard problems. 

Similarly, it is not possible to reduce the pseudopolynomial dependency on e -1 in general. The 
precise statement is given in Section[A]of the Appendix as well, and the intuitive reasoning is similar: 
if there was a formulation of size polynomially dependent on log(e _1 ) (and not on e -1 ) we could 
again solve NP-hard problems in polynomial time. 
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3 Network mixed-integer polynomial optimization problems 

Here we return to the network polynomial optimization problems presented in the introduction, 
and provide a proof of Theorem [7] We will first motivate the technical approach to be used in this 
proof. Consider an NPO instance with graph G. For each u G V(G), X u denotes the set of variables 
associated with u and deg G (it) is the degree of u. At each u G V{G) we have a set of polynomial 
constraints of the general form 

£ p*„(. X u ux v ) >0, k = 1,, K u (12) 

{u,v}e5(u) 


k 

u.v 


is a polynomial (possibly K u = 0). Without loss of generality 
no two polynomials p^ v {u, c} G 5(u ) in (12) have 
a common monomial. If that were not the case, we could always combine the common monomials 
and assign them to a single v . This allows us to have 


associated with u, where each p„ 
we assume that, for every u and k = 1,..., K., 


V v k 

/ j yu,v 
{u,v}£S(u) 


£ 

{u,v}e5(u) 


| p k | 


k — 1,..., A i. 


Now define 


A = max { \X V \ + K v ). and 
vev(G) 

D = max {deg G (r;)}. 
v&V{G) 


(13) 

(14a) 

(14b) 


As discussed above, we cannot reduce Theorem [7] to Theorem [4] because even if the graph G un¬ 
derlying the NPO problem has small tree-width, the same may not be the case for the intersection 


graph of the constraints: a constraint (12) can yield a clique of size |<5(u)| in the intersection graph 
(or larger, if e.g. some \X V \ > 1). In other words, high degree vertices in G result in large tree-width 
of the intersection graph. 

One immediate idea is to employ the tech¬ 
nique of vertex splitting Suppose u G V(G) 
has degree larger than three and consider a par¬ 
tition of S(u) into two sets Ai, A 2 . We obtain 
a new graph G' from G by replacing u with two 
new vertices, u± and u 2 , introducing the edge 
{u\,U 2 } and replacing each edge {u 7 v} G Ag 
(for S = 1,2) with {us,v}. See Figure[3] 

Repeating this procedure, given u G V{G) 
with deg G (zt) > 3 we can replace u and the set of edges 5{u) with a tree, where each internal vertex 
will have degree equal to three. To illustrate this construction, consider the example on Figure |4] 
In this figure, a degree-5 node u and the set of edges {(u,Vi) : 1 < i < 5} is converted into a tree 
with three internal vertices (r, a and b) and with each edge (u,Vi) having a corresponding edge in 
the tree. To keep the illustration simple, suppose there is a unique constraint of the form 



Figure 3: Vertex splitting. 


5 

Pu,vj (V M u X Vi ) > 0. (15) 

i= 1 

associated with u in the given NPO. This constraint can be transformed into the following system 
1 See [27| for a column splitting technique used in interior point methods for linear programming. 
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Figure 4: Example of complete vertex splitting. 


of constraints: 


Pu ,V 2 (X U UX V2 ) 

Pu,V 3 ^-^-U U X v3 ) 
(Kmlli + \\Pu,v 3 \\i){yt -Va) 


Pu ,v 4 (X u U X V4 ) 
Pu,V 5 ( X U U Xy 5 ) 

(Kmlli + \\Pu,vAi){yb -Vb ) 


Pu,v± ( X u U X Vl ) 


Vr 


\\Pu,v 2 \U (Vv 2 - Vv 2 ) (16a) 

\\Pu,v 3 \\i (yt 3 - Vv 3 ) (16b) 

\\Pu,v 2 \\l (Vv 2 - Vv 2 ) + \\Pu,v 3 \\l(yt 3 - Vv 3 ) (16c) 

Kmlli (yti - VvJ (16d) 

\\Pu,v 5 \\i (yts - Vv 5 ) (16e) 

\\Pu, Vi \\i {vX - VvJ + \\Pu,v 5 \\i{yv 5 - Vv 5 ) (I6f) 

IK,mill (vX - VvJ (16g) 

IK,mill (Pvt - 3/m) + (Kmlli + Kmlli)0/o ~Va) 

+ (IK,mlli + IK.mlli)^ - Vb) (l 6h ) 


0 <y + ,y~ < l. 


Clearly, substituting (15) associated with u with the equivalent system (16) yields an equivalent 
NPO where 


X a = X U U {y+, y a , J/+ , y V2 , y+ , y V3 } 
X b = X u U{y^,y b ,y+ i ,y- i ,y+ v y-J 
X r = X u U{y+,t/+,y“} 


and constraints (16aI, (16bI and (16c) are associated with vertex a, constraints (16d), (16e) and 
(16f) are associated with vertex 6, and constraints (16g) and (16h) are associated to vertex r. It is 
important to notice that we do not associate variables and y~ with nodes ly, but rather with 
internal vertices of the tree. The motivation for this detail is that vertices Vi may have degree 
greater than three, and would (later) be split as well. Thus, adding new variables to X v . might 
create difficulties when defining the general procedure. This is avoided by keeping the same sets X Vi 
associated with after a neighbor of Vi is split, as in the previous example. 

We indicate the general formal procedure next; however we warn the reader in advance that this 
strategy may not directly deliver Theorem [7] because the splitting process may (if not chosen with 
care) produce a graph with much higher tree-width than G. This is a technical point that we will 
address later (Section [3.0.4 ). 

To describe the general procedure we use the following notation: given a tree T, an edge of T is 
called pendant if it is incident with a leaf, and non-leaf vertex is called internal ; the set of internal 


11 
















vertices is denoted by int{T). Now fix a vertex u of G such that deg G (w) > 3. Let T u be an arbitrary 
tree where 


• int(T u ) n V(G) = 0, T u has deg G (w) leaves, and for each edge {u,v} G E(G) there is one 
pendant edge {i,v} of T u , and 

• each internal vertex of T u has degree equal to three, 

Then completely splitting u using T u yields a new graph, G where V(G) = (V(G) — u) U int(T u ), 
and E(G) = (E(G) — Sg(u)) U E(T u ). 

To obtain an NPO system in G equivalent to the original system, we replace each constraint (12) 
that is associated with u with a family of constraints associated with the internal vertices of T u . To 
do so, pick an arbitrary non-leaf vertex r of T u and view T u as rooted at r (i.e., oriented away from 
r), and define, for each internal vertex i of T u and k = 1,..., K u . 


= 


E{i 


|Pu «l|i : ( u > v ) G 6 (u), v a descendant of i in T u 


Then, for each internal vertex i of T u we have 

(a) All variables in X u are associated with i. 

(b) For each k = 1,..., K u . if i ^ r we additionally associate two variables, yf k , y~ k G [0,1], and 
if i = r we only add y+ k G [0,1] with i. If i has a child v that is a leaf , and hence {iz, u} G S(u), 
then we associate two additional variables y+ k , y~ k G [0,1] with i. 

(c) If i ^ r, then letting j, l be its children we write the following constraints associated with i: 

Vj{yt,k ~ yj,k) + v i(vt,k - y^k) = y?, k - 1 /,"*) 1 < k < K n■ ( 17 ) 

If on the other hand i = r, let j, l, h be its children. Then we write 

"rllr,k - "s(yt,k - Vs,k) = 0 1 < k < R n- ( 18 ) 

s=j,l,h 


Finally, given a leaf v of T u with parent i, then by construction {w,t>} G S(u). Then we add the 
following constraints, associated with i (not w), 


\\p k uji(y+ k ^ Vv.k) - p k uJXu ui„) = o i < k < k u 


(19) 


Let us denote the initial NPO by P. Since the sum of constraints @ and @9) is @ 
we have obtained an NPO equivalent to P. Note that for any v G V{G) — u the degree of v is 
unchanged, as is the set X v and the set of constraints associated with v. Thus, proceeding in the 
above manner with every vertex u G V(G) with deg G (it) > 3 we will obtain the final graph (which 
we denote by G') of maximum degree < 3 and an NPO, denoted by P' which is equivalent to P. 

The following lemmas lay out the strategy that we will follow to prove Theorem [T] To prepare 
for these, we need a technical remark, which follows from the definition of tree-decomposition. 


Remark 19 Suppose that (T,Q) is a tree-decomposition of a graph H. Suppose a subset of vertices 
S induces a connected subgraph of H. Then 


{t G V(T) : Q t O A ^ 0} 


induces a subtree of T. 

Returning to our construction, we assume that we have a tree decomposition (T\Q r ) of G'. 
Note that each vertex v of G' is derived from some vertex, say u(v) G V(G) and the set of variables 
associated with v under P' is a copy of X v / v \, together with the y + and y~ variables introduced 
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above. We have one pair of such variables per each constraint (12) associated with u(v) and an extra 
pair when v is the parent of a leaf in T u ( v \. 

We consider now the following pair: ( T',Q '), with each Q' t t £ T' defined as follows. For each 
v £ Q' t , the set Q' t will include (1) X u i v ) together with the y + and y~ variables associated with v 
or its children in T u r v y This can include up to \X V \ + 7 K v variables for each v (the bound is tight 
at the root r which is the only internal vertex with three children; further we are counting yf ~). (2) 
If v is the parent of leaf in T u ^ v y say w, then we also add to Q[ the set X u / W y Further, the number 
of constraints associated with internal vertices is clearly < 3K V . 

Lemma 20 The pair (T',Q') is a tree-decomposition of the intersection graph of P 1 . 

Proof. We need to show that (a) for any variable x :) of P'. the set of vertices t of T' such that Q' t 
contains Xj forms a subtree of T', and (b) that for any edge { Xk , Xj} of the intersection graph of P' 
the pair x^,Xj are found in a common set Q' t . 

Part (b) follows directly from the construction of the sets as each of the constraints 
(18) and (|19|) involve only one internal node and its children, including the case when some children 


are leaves, which is accounted for in (2). 

As for part (a), the statement is clear if Xj is one of the y + ,y~ variables. If, instead, Xj is 
contained in some set X v ( v ^ then Xj is associated with every vertex in T u y,\. By definition of NPOs, 
the set of w vertices of G such that x :) £ X w forms a connected subgraph of G. It follows that the 
set of vertices v of G' such that x :) is associated with v in P' forms a connected subgraph of G'. 
Part (a) now follows from Remark 


19 


Lemma 21 Suppose that G' has tree-width < W. Then the intersection graph of P' has tree-width 
< 7A(W + 1) - 1. 

Proof. Consider a tree-decomposition (T r , Q') of G' of width < W and construct Q' as before. We 
claim that the width of ( T',Q ') is at most 7A(W + 1) — 1. But this is clear since \Q' t \ < W + 1, 
and each v £ Q' t will contribute at most three extra sets of the type X w for some w, along with the 
\X V \ + 7 K v quantities stated before. Hence, 


\Q't\ <7A(W + 1) 


as desired. 


Lemma 22 Given 0 < e < 1, there is a linear programming formulation LP for the NPO problem 
on G, of size 0 ((Dtt/ e)° < - AW ' ) n loge -1 ), that solves the problem within scaled tolerance e. 


Proof. Suppose we apply Theorem 15 to problem P', for a given tolerance 0 < 9 < 1. Note 
that, in order to do so rigorously, equations and ( |19[ ) must be first transformed to 2 inequality 
constraints each. 

The resulting linear programming formulation, LP, will have size O((27 t/ 0)°^ am/ ^ n log# -1 ) and 
will yield ^-approximate solutions to P’. In particular, these solutions may violate a constraint of 
P’ by an amount proportional to the 1-norm of the constraint, times 0, as guaranteed by Theorem 

m 


Consider a constraint (12) of problem P, associated with a vertex u of G, with 1-norm denoted 
by \\Pu\\i- In P' this constraint has been replaced by a family T of constraints, one per each vertex 
of T u whose sum yields (12). Simple algebra shows that the solution to LP may violate (12) by an 


amount equal to the sum of violations of the constraints in T. The 1-norm of each such constraint 
is at most 4||p^||i, since if i is an internal node of some tree T u with children j,l then 

v’f = i, fc + I/* 


2 In the case of a constraint that had the form p{x) = 0 in P 1 , and which was expressed as two inequalities, it 
means that a 0-scaled feasible vector x will satisfy 

p(x) > — 0||p||i and — p(x) > —0||p||i and thus |p(£)| ^ 1 
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and for a root node r with children j, h , l 


\\p k u\\l=^ = Vj+Vh+Vl 


by (13). Since T u has deg G (u) leaves and all internal vertices have degree 3, it can be easily shown 
that it must have less than 2deg G (u) vertices in total. From this it follows that the LP solution 
violates (12) by at most 8||p„||i deg G (u)6f Choosing 9 = (8U) -1 e, and noting that for a constant c 

{cDn/e)° (AE) G 0 (D7r/e) o( - AE ' > ) 


yields the desired result. ■ 


As per Lemma 22 these constructions obtain Theorem [7] provided that we find a vertex splitting 
with W = O(w). As discussed in the next section, this condition may fail to hold under an arbitrary 
vertex splitting, and some care is needed. 


3.0.4 Finding good vertex splittings 

Consider the example given in Figure [5] Here the vertices of the graph shown in (a) are arranged 



.tutu 

1 2 k 

(b) 


Figure 5: Incorrect vertex splitting. 


into k > 1 columns. The odd-numbered columns have k vertices each, which induce a path, while the 
even-numbered columns have a single vertex which is adjacent to all vertices in the preceding and 
following columns. It can be shown that this graph has tree-width 3. In (b) we show the outcome 
after splitting vertices so that the maximum degree is four. This second graph has tree-width fc, and 
further splitting the degree-four vertices will not change this fact. 





Second step 


Figure 6: Correct vertex splitting of graph in Figure [5^ a). 

In contrast to this situation, suppose that we split the graph in Figure [5ja) in two steps as 
shown in Figure [6] The tree-width of the final graph is also 3. The difference between Figures [5] 
and [6] is explained by the fact that the splitting initiated by the “first step” in Figure [6] exploits the 
tree-decomposition of width 3 of the graph Figure [5ja). 

Next we turn to a formal approach that produces the desired outcome in the general setting. 
Given a graph G, a simplification of G will be graph G obtained by a sequence of complete vertex 
splittings, such that the maximum degree of a vertex in G is < 3. The following Lemma will show 
how to obtain a simplification of a graph via a vertex splitting that maintains tree-width up to a 
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constant factor. In the proof, the trees T u that yield the general splitting procedure stated above, 
will be explicitly defined. 

Lemma 23 Let G be an undirected, graph and ( T , Q) a tree-decomposition of G of width Z. Then 
there is a simplification G of G and a tree-decomposition (T,Q) of G of width at most 2 Z + 1. 

Proof. We first modify (T, Q) in a sequence of steps. 

Step 1. For any edge e = {u, £ E{G), choose an arbitrary t £ V{T ) with e C Q t . Then we 

modify T by adding to T a new vertex, t e and the edge {t e ,t}. Further, we set Q t t> = {«,*'}• 

Step 2. Without loss of generality, every vertex of T has degree at most 3. To attain this condition, 
consider any t £ V{T) with Sx(t) = {si,... , Sd} (say) where d > 3. Then we alter T by replacing 
t with two vertices adjacent vertices t 1 and t 2 , such that t l is also adjacent to Si and S 2 and t 2 is 
adjacent to S 3 ,... ,Sd■ Finally, we set Q t 1 = Qt 2 = Qt- Continuing inductively we will attain the 
desired condition. 


Step 3. For any vertex u £ V{G ) let T u be the subtree of T consisting of vertices t with u £ Qt, and 
T u be the subtree of T u that spans {t e : e £ Sq(u)} (which is a subset of the leaves of T u ). Then 
we modify (T, Q) by replacing T u with T u , yielding a new tree-decomposition of same or smaller 
width. In other words, without loss of generality every leaf of T u is of the form t e for some e £ 5g{u). 


We can now describe our vertex splitting scheme. Consider u £ V(G) with deg G {u) > 3. We 
say that a vertex of T u is blue if it is either a leaf or of degree three in T u . Now we form the tree 
T u whose vertex-set is the set of blue vertices of T u , and whose edge-set is obtained as follows. 
By construction, E(T U ) can be partitioned into a set of paths whose endpoints are blue and which 
contain no other blue vertices. For each such path, with endpoints a and b (say), the tree T u will 
contain the edge {a, b} (in other words, T u can be obtained from T u by subdividing some edges and 
so T u and T u are topologically equivalent). Note that T u has deg G (w) leaves, each internal vertex 
with degree 3, and for each edge {u,v} £ E{G) there is one pendant edge, as needed. 

Let G be the graph obtained by the complete splitting of u using T u . For each internal vertex 
t £ V(T u ) we name vf the corresponding new vertex in G, to emphasize that each non-leaf vertex in 
T u will create a copy of u (recall that the leaves in T u will correspond to the neighbors of u). This 
operation does not change the degree of any vertex v £ V(G ) with v 7 ^ u. The eventual graph G in 
the proof will be obtained by applying complete splittings at every vertex of degree > 3 in G. 

Returning to G we construct a tree decomposition (T, Q) as follows. First, let us regard the tree 
T u as rooted at some internal blue vertex r(u). For a vertex t £ V(T U ) let R u (t) be the closest blue 
ancestor of t in T u ; we write R u {r(u )) = r{u). Then, for t £ V(T), we set 

( {Q t - u) U {u R ^}, if t £ V(T U ) and deg Tu (f) = 2 , 

Qt = < {Qt - u) U if t £ V{T U ) and deg T {t) ^ 2, (20) 

{ Qt, if t£V{T u ). 


Now we argue that {T,Q) is a tree-decomposition of G. To see this, note that if t £ V{T U ) then 
u* € Q s iff s = t or s is a child of t in T u , thus the endpoints of any edge {u l ,u s }, where t is the 
parent of s, will be contained in Q s . Further, for any edge of e = {u, w} of G, by Step 3 above there 
will be a leaf t e of T u such that the edge {t e ,R u {t e )} £ E{T U ). This corresponds to a pendant edge 
{ u ft(t e ), ?; } g E{G) and by construction both v £ an d g The fact that every vertex 

in G induces a connected subgraph in T can be easily verified. This completes the argument (T, Q) 
is a tree-decomposition of G. 

Notice that for v £ V{G) with v 7 ^ it, the subtree T v is the same in (T,Q) and {T,Q). Thus, 
applying the complete splitting of every vertex of G of degree greater than three, and modifying the 
tree-decompostion as in (20) will produce a tree-decomposition (T, Q) of the final graph G. 

By construction, for each t £ V(T) we obtain Q t from Q t by replacing each element with (at 
most) two new elements. Thus, since \Qt\ < Z + 1, the width of {T,Q) is at most 2 {Z + 1) — 1. ■ 


15 



4 Pure binary problems 

In this section we will consider Theorem [9] of the Introduction. As we mentioned above, it is one 
of the building blocks towards the other main results, but is of independent interest as well. We 
will provide additional background, a deep analysis of this result, and state and prove an expanded 
version of the Theorem. First we begin with some examples for problem GB. 


Example 24 (Linear binary integer programming). Let A be an m x n matrix, and consider a 
problem min{c T a; : Ax > b, x £ {0,1}"}. To view this problem as a special case of GB, we set for 
1 < i < m, K[i\ = {1 < j < n : a,j / 0} and 5[z] = {x £ {0,1} A 'M : J2 jeK ^ aijXj > 6J. 

In this special case, problem GB can be addressed by a variety of methods. Of particular interest 
in this paper are the reformulation or lifting methods of |42| and m- Next we consider a more 
complex example, chosen to highlight the general nature of the problem. 

Example 25 Let d,r,p be positive integers. Consider a constrained semidefinite program over bi¬ 
nary variables of the form 


r d d 

min EEE (21a) 

k—1 i—1 j= 1 

subject to: M k • X k = bk, 1 < k < r, (21b) 

X k £ S+, 1 <k<r, (21c) 

^ X k j = 0 mod p, 1 < k <r, (2 Id) 

X k - { = X k j\ 1 < * < d, 2<k<r, (21e) 

X k j G {0,1}, \/i,j,k. (21f) 


Here S)f is the set of dx d positive-semidefinite matrices, Mi ,..., M r are symmetric dx d matrices, 
and b and c are vectors. Constraint (21e) states that the first column of matrix X k is identical to 
the last column of matrix X k ~ 1 . 

We obtain an instance of problem GB with m = 2r — 1, as follows. First, for each 1 < k < r we 
let K[k\ be the set of triples ( i,j , k ) with 1 < i,j < r, and S k to be the set of binary values X k j that 
satisfy { 21b]) -(21 d). Next, for each 2 < k < r we let K[r + k — 1] be the set of all triples (i, 1, k — 1) 
and all triples (i,d,k) and S r+k to be the set of binary values (indexed by K[r + k — 1\) such that 
(21 e) holds. 


In the case of this example, a direct application of standard integer programming methods appears 
difficult. Moreover, we stress that the sets S l in problem GB are completely generic and that the 
membership oracle perspective can prove useful as we discuss below. 

Theorem [9] concerns the tree-width of the intersection graph of a problem of type GB. Recall 
that as per Definition [I] given a problem instance I of GB, the intersection graph for I has a vertex 
for each 1 < j < n, and an edge {j,k} whenever there exists 1 < i < m such that {j,k} C K[i], 
that is to say, j and k appear in a common constraint in problem GB. 


Example 26 (Example |^5[ continued). Here the set of variables is given by 

{( i,j,k ) : 1 < k < r and 1 < i,j < d}. 


The intersection graph of the problem will have 

(a) the edge {(i, j,k), (i',j',k)} for all 1 < k < r and 1 < i,j,i',j' < d, arising from constraints 

(b) the edge {(i,l,k),(i,d,k — 1)} for each 1 < k < r and 1 < i < d, arising from constraints 

(2ld[). 
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A tree-decomposition (T,Q) of the intersection graph, of width 0(d 2 ), is obtained as follows. Here, 
T is a path with vertices vi,U 2 ,V 2 ,U 3 ,... ,v r -i,u r ,v r . For 1 < k < r we set Q Vk = {(i, j,k) : 1< 
i, j < d} and for 2 < k < r we set Q Uk = {(z, 1, k), [i, d, k — 1) : 1 < i < d}. Sets Q Vk account for 
all edges of type (a), whereas the sets Qu k cover all edges of type (b). Thus Theorem^2^ states that 
there is an LP formulation for problem (21) with 0(2 d d 2 r) variables and constraints. 


We now state the main result we will prove regarding problem GB, which implies Theorem [9] (a). 
A proof of part (b) of Theorem [9] is given in Section [B] of the Appendix. 


Theorem 27 Let ( T , Q) be a tree-decomposition of the intersection graph of a problem GB. Then 

(a) There is an exact (continuous) linear programming reformulation with 0(J^ t 2^*1) variables 
and constraints, the same objective vector c and constraints with {0,1, — 1 }-valued coefficients. 

(b) The formulation can be constructed by performing 2 oracle queries and with additional work¬ 
load 0 *(cv 2 u J2 t £V(T) I {* ' — Qt} I +wmn), where the notation indicates logarithmic 

factors in m or n. 


As a corollary, if the width of ( T,Q ) is to, the formulation has 0{2 ul n) variables and constraints. 
Hence for each fixed u> the formulation has linear size. 


The “corollary” statement follows because if an n-vertex graph has a tree-decomposition of width 
w, say, then it has one with the same width and where in addition the tree has at most n vertices 
(see Remark |30j below). To illustrate, we show what this result implies when applied to one of our 
previous examples: 


4.0.5 Remark. Reduction to the linear case 

Consider a problem instance of GB. An apparently simpler alternative to the general approach we 
follow would be to construct, for 1 < * < m, the polyhedron 

Pi = conv {a: e {0, T} m C R K M. 

Thus we can write Pi as the projection onto of a polyhedron {x £ [0,1]" : A l x > b 1 } where 

each row of A 1 has zero entries on any column not in K[i]. Thus the GB problem can be restated 
as the equivalent linear integer program 


min c T x (22a) 

subject to: A 1 x > b 1 , 1 <i<m (22b) 

x £ {0,1}". (22c) 


Switching to this formulation makes it possible to apply general integer programming methods to 
problem GB. However, this analysis ignores the size of formulation (22). In particular, for integer 
d > 1 large enough there exist examples of 0/1-polytopes in M. d with at least 


d \ d/4 

logd; 

facets (up to constants). See [7], [25], [3J. Using this observation, one can construct examples of 
problem GB where the tree-width of the intersection graph is w = d — 1 and each of the matrices A‘ 
has more than w“^ 4 rows (see Example |28| below). This dependence on w makes any classical integer 
programming method more computationally expensive than using the method presented above. 


Example 28 Choose d > 2 large enough so that there is a 0/ 1-polyhedron P C ]R d with more 
than (cd/ log d) d//4 facets for some c. Let P be given by the system Ax > b, where A is M x d 
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(M > (cd/\ogd) d ^). Choose N > 1 , and consider the system of inequalities over binary variables 
x l j, for 1 < i < N and 1 < j < d: 


Ax 1 > b, 1 < * < N, 


Xa = Xa 


2 <i<N, l<j< [d/2\. 


x 1 a binary for all i and j. 


(23a) 

(23b) 

(23c) 


Constraint (23a) indicates that this system includes N copies of polyhedron P, with each copy de¬ 


scribed using a different coordinate system. Constraint (23b) states that the first [d/ 2J coordinates 
take equal value across all such systems. 


Any linear program over (23) is can be viewed as an example of problem GB with m = 2 N — 1; 


for 1 < i < N, K[i\ is used to represent the d variables x* (1 < j < d) and S l is a copy of the set 
of binary points contained in P (i.e. the extreme points of P). 

The intersection graph of this instance of GB will be the union of N cliques (one for each set 
of variables x l ) plus the set of edges {x\,x\} for 2 < i < N. A tree-decomposition (T,Q) of this 
graph, of width d — 1, is as follows: T has vertices u(0), as well as u(i) and v(i), for 1 < i < N. 
Further, Q u(0) = {x) : 1 < j < [d/2 \}; and for 1 < i < N = Q u(0) U {x* : 1 <j< [d/2 \} 
and Q v (i) = {x*-, l < j < d}. Thus, w = d and Theorem 21 states that any linear objective problem 
over constraints l(23\) can be solved as a continuous LP with 0(2 d dN) variables and constraints. In 
contrast, system 1 (2,A has more than (cd/ log d) d ^N constraints, and in particular the same is true 
for the level-d RLT reformulation. 


As the example shows, formulation (22) may be exponentially larger than the linear program stated 
in Theorem l27l 


4.1 Proof of Theorem 


In this section we discuss a construction that yields Theorem 27 by relying on methods from [39] . 
Fundamentally the construction employs the “cone of set functions” approach of 02] (also see [35]), 
together with an appropriate version of the “junction tree theorem” m as developed in 3!T. In 
addition, we provide a second formulation in Section 0 of the Appendix, and a direct proof. 

Consider an instance of problem GB. Let G be the corresponding intersection graph, and (T, Q) 
be a tree-decomposition of G of width io. We begin with some general remarks. 


Remark 29 Suppose that (T, Q) is a tree-decomposition of a graph G. Then for any clique K of G 
there exists t £ V(T) with K C Q t . 


As a result, for 1 < i < m there exists t £ V(T) with K[i\ C Q t , i.e, the indices of the support of 
each constraint must be contained in some node of the tree-decomposition. 


Remark 30 Without loss of generality, |P(T)| < n. To see this, note that the tree-decomposition 
(T, Q) gives rise to a chordal supergraph H of G. Since H is chordal, there exists a vertex u whose 
neighbors (in H) induce a clique. The claim follows by induction applied to the graph H — u, using 
Remark \2S\ and noting that a tree-decomposition of H is also a tree-decomposition ofG. 

Definition 31 Let t £ V(T). 

(a) We say that v £ {0,1}®* is -feasible if vk[{\ G S l for every 1 < i < m such that K\i\ C Q t . 

(b) Write F t = {v £ {0,1}^* : v is Qt-feasible}. 


Now we can present the formulation. The variables are as follows: 

• A variable A*, for each t £ V(T) and each vector v £ F t . 

• A variable Z§, for each S £ 2 V( C" > such that S C Q t for some t £ T. 
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We also write 

V = V(G) = the set of all variables indices for the given instance of GB, 
so that \V\ = n. The formulation is as follows: 


(LPz): 


n 

min Y, C > Z ifi 
j =1 


s.t. Vi G V(T) : 

: w e Ft, Vj = 1 Mj G 5} VS c Q t 

E A ‘ = !» A ‘>°- 

v£F t 


(24a) 

(24b) 

(24c) 


Constraints (24b) enforce consistency across different t G V(T). In fact the Z variables can be 
eliminated with (24b) replaced with relationships among the A variables. Constraint (24b) can be 


restated in a more familiar way. Given t G T, (24b) states: 


Z s = xlc uMv ’ Qt \ 

vEF(t ) 


VSG 2 Qt . 


(25) 


Here, given a set Y and a vector w S l 5 , supp(w,Y) = {j G Y : Wj 0}, and for any set Y and 

p CY the vector £ p G {0, l} 2 ' is defined by setting, for each q C V, 


C p = 

So 


1 if q C p 
0 otherwise. 


Constraints (24b)-{24c) describe the Lovasz-Schrijver approach to lifted formulations, restricted 


to a given set Qt- It is clear that LPz amounts to a relaxation for the given problem GB, in the 
sense that given x feasible for GB then there is a vector [Z,, A) feasible for LPz where Z^ = x 3 for 
1 < j < n. To do so, let t G V(T) and denote by x 4 the restriction of x to Q t . Then by definition 
we have that x* G F t . Thus we can set A| t = 1 and A* = 0 for any other v G F t , and for any S C Q t 
Zg = £g Upp ( x ' xhe last equation simply states that Zg = 1 iff S C supp(x,V), a consistent 
definition across t G V(T). Hence indeed (Z, A) is feasible for LPz and attains Zij i = Xj for each 
j G V, as desired. Note that, effectively, we have argued that the restriction of £ su pp( x ’ V ) ]g> 2 Qt 
for t G V{T) yields a feasible solution to LPz. 

Next we argue that (24b)-(24c) defines an integral polyhedron. This is a consequence of the 


following result, which can be obtained from Lemma 8.18 of [39], although it is not stated there in 


the language of constraints (24b)-(24c). 


Lemma 32 Suppose that (Z, A) is a feasible solution to (24b)- \24q ). Then there exists a vector 
W G R 2 , nonnegative values 6±, ..., 9^ and vectors y 1 ,..., y k in such that: 

(1) Zg = Wg, for all S G U t6V( T)2°*. 

( 2 ) Ei=i °i = 1 - 

(3) y 1 is feasible for GB, for 1 < i < k. 

(4) w = c supp(y, ’ y) . 

As a consequence of (2)-(4), the vector W is a convex combination of the vectors Q su ppIv Y) 
as argued above yield feasible solutions to LPz, thus yielding the desired result. We remark that the 
proof of Lemma 8.18 of [39] is related to that of the tree-junction theorem; this technique, evocative 
of dynamic programming, was also used in [12] in a closely related setting. 

To complete the proof of Theorem [27] we note that the total quantity of variables and constraints 


in LPz is 0(J2t^v(T) 2^ 4 ') = 0(|V(T)|2“). This yields part (a) of Theorem 
using standard algorithmic techniques. 


27 


part (b) follows by 
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A Dependence on e in Theorem 15 


In this section we will prove that the two characteristics of Theorem 15 regarding e (approximation 
notion and running time) cannot be improved. 

First, suppose that there is an algorithm A such that any PO whose intersection graph has tree- 
width < 2 can be solved in polynomial time to some given feasibility tolerance e < 1, that is to say 
the algorithm guarantees fi(x) > — e for any constraint fi(x) > 0. Note that since e is fixed in this 
case, the formulation in Theorem [l5] yields an algorithm that runs in polynomial time (see the result 
on Theorem 27 for the time it takes to build the LP formulation) but with a weaker approximation 
guarantee than the hypothetical algorithm A. 

We claim that the existence of algorithm A implies P = NP. Consider the subset-sum problem: 
given n > 2 positive integers a±,... ,a n find / C {1,..., n} such that X/, 6 / a i = °J• Denoting 


S = - '^2 a j and Af = 4?rS 1 , 

Z 3 =1 


the subset-sum problem can be cast 

as the following (pure feasibility) PO: 


MSyi 

= 

Ma 1 * 1 , 

(26a) 

MSyi 

= 

MdiXi + MSyi- 1 ) 2 < i < n, 

(26b) 

My n 

= 

M, 

(26c) 

Mxj(l — Xj) 

= 

0, 1 <j<n, 

(26d) 

0 < y 3 

< 

1 , 1 < j < n, 

(26e) 

■Os 

H 

VI 

o 

< 

1 , 1 < j < n. 

(26f) 

Given a solution ( x , y) to (f2G| it is clear that x G {0,1 }” and that X)/ a j x j = \ J2j= l a j 
the intersection graph of (|26|) has tree-width 2. 

By assumption, algorithm A will produce a solution (x, y) that violates each of the 

Moreover, 

constraints 


(26a)-(26d) by at most e and that satisfies (26e)-(26f). Then adding (26a)-(26c) yields 


E “Aj - s 

3 =1 


< 


en 
M' 


(27) 


Moreover, by (26d) and (26f) for each 1 < j < n, 


either 0 < Xj < or 1 — || < Xj < 1. 

[This follows from the fact g{x) = x(l — x) is strictly increasing in [0,1/2), strictly decreasing in 
(1/2,1], and g(2e/M) > e/M.\ Thus, suppose we round each Xj to the nearest integer, obtaining 
binary values Xj for 1 < j < n. Using (27) we obtain 


E 

3 =1 


cijXj — S 


en i \ ^ 

i M+ E' 


v 1— 1 


2e 

M 


en 

M 


4 Se 


and therefore 


E “Aj ~ S 

j =i 


< 1. 


Since the left hand side of the inequality must be an integer, we conclude that 


E 

j=i 


cijXj = S; 
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which proves that unless P = NP algorithm A does not exist. 

Next, suppose that now that there is an algorithm A that, for any e < 1 solves PO problems to 
scaled tolerance e (i.e. the violation of any constraint fi{x) > 0 is at most e||/||i) but whose running 
time is polynomial, i.e. in particular it depends polynomially on loge^ 1 . This is in contrast with 
the formulation in Theorem 15 yields an algorithm that runs time polynomial on n, m and e — . 
Consider an unsealed version of the previous formulation of the subset-sum problem, i.e: 

(28a) 

! < i < n, (28b) 

(28c) 
(28d) 
(28e) 
(28f) 

Define e = 1/(3SM) and use algorithm A to find a solution (x,y) that is scaled-e-feasible. Since 


Syi 

= 


Syt 

= 

OiXi + Syi- 1 

Vn 

= 

1, 

(! - Xj) 

= 

0, 1 < j < n 

VI 

o 

< 

1, 1 < j < n 

0 < Xj 

< 

1, 1 < j < n 


the 1-norm of any polynomial in constraints (28) is at most 2 S + 1, we get that for each constraint 
fi(x,y ) > 0 

fi{x,y) > —e||/i||i > ~e(2S+l) > 

This way we can reuse the same argument as before to obtain a solution to the subset-sum 
problem. Since we assume the running time depends on log^ 1 ) we get a running time that depends 
polynomially on log(nS') yielding the same contradiction as before. 


B Proof of part (b) of Theorem [9] 


Here we describe a procedure that constructs formulation (24) which requires 2 “m oracle queries 
and with additional workload 0*(2 ul uj(n + to) + wran), as per Theorem [9] (b). Here, as per the 
formulation, we have a tree-decomposition ( [T , Q) of the intersection graph of a problem GB , of 
width oo. The critical element in the procedure is the construction of the sets F t used in equation 
(24b), and we remind the reader of Definition 1311 and that for 1 < i < m constraint i has support 


K[i] and the set of feasible solutions for constraint i is indicated by S l C {0,1} A M. Note that 
\I<[i\ \ < oo for all i. The procedure operates as follows: 

1. For each constraint *, enumerate each partition of K[i\. Given a partition (Ai,Aq) if the vector 
V £C {0,1} A W defined by yj = k if j € Ak (k = 0,1) is such that y ^ S l (i.e., not feasible) then 
we record the triple (*, Ai, Ao) as a vector of length \I\ [i]| + 1 [with some abuse of notation]. This 
process requires 21I*] I oracle queries. This sum of all these quantities is 0{2 ul m) but the more 
precise estimate will be needed. 

2 . Let £ be the list of all vectors recorded in Step 1, sorted lexicographically; first by the index i, 
then by A\ and then by A$. After postprocessing if necessary, we can assume that C contains no 
duplicates. These can be performed in time 0(uj\C\ log |£|) = 0(2 ul m{uj + log to)). 

3. For each t e V(T) construct a list of all constraints i such that K[i] C Q t . This can be done in 
time O(ojmn). 

4. For each t £ V(T) we form the sublist of £ consisting of all vectors (i,Ai,Ao) (constructed in 
Step 1) such that K[i\ C Q t . Note that for any such i the total number of such vectors is at most 
2 l A MI. Given a vector (i,Ai,Ao) thus enumerated, we form all vectors of the form (A\,A' 0 ) such 
that A! x U A ' 0 = Q t and A k C A' k for k = 0,1. Let £ l be the list of all vectors obtained this way. 
Clearly, |£*| < 2“| {i : K[i] C Q t } |. We lexicographically sort £ l . 

5. For each t £ V(T) we enumerate all vectors y £ {0,1}^*. For any such vector y , we have that 
y £ F t if and only if y is not found in the list and this test can be performed in time 0(wlog 1/1*1) 
after lexicographically sorting the list. 
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The total amount of work entailed in Step 4, using \Q t \ < ui for each t £ V(T), is 

t t 

Likewise, Step 5 requires 0(u 2" log \£ t \) — 0 *(uj 2 2“). This completes the proof. 


C Alternative formulation for Theorem 27 


Here we construct a second formulation that also yields the result of Theorem |27[ The number of 
constraints and variables in this new construction is upper bounded by those for LPz. 


C.l Additional Definitions and Second Formulation 

Definition 33 Lett £ V(T). We let fit denote the set of pairs ( Y,N ) withYCiN = 0, YUN C Q t , 
and such that 

1. \Y\ < 1 and |7V| = 0, or 

2. (' Y,N) partition Qt n Q' t , for some t' £ V(T) with {t,t'} £ E(T). 

The formulation is as follows. The variables are: 

• A variable A*, for each t £ V(T) and each vector v £ F t . 

• A variable X[Y, TV], for each pair ( Y, N ) £ 2 n x 2 n with ( Y, N ) £ fl t for some t £ V(T). 


n 

(LP-GB): min ^c,-A[{j},0] (29a) 

j =i 

s.t. Vt £ V(T) : 

x[y, n] = ^2 k, n n (! - v ( y - n ) e ( 29b ) 

v&F t j£Y jeN 

Y, x v = f> A ‘ > °- ( 29c ) 

vGF t 


We will show below that (a) LP-GB is a relaxation of GB and (b) the relaxation is exact and that 
the polyhedron defined by (29b)-(29c) is integral. 

Remark 34 (f.l) When ( Y,N ) partition Qt f\Qt' for some edge {t,t'} then variable X[Y, N] will 
appear in the constraint 1 29b\ ) arising from t and also that corresponding to t'. This implies 
an equation involving A* and A* . 

(f-2) The sum on the right-hand side of constraint \29b\) could be empty. This will be the case if for 


any v £ {0, l} 1 ^ 4 with Vj = 1 for all j £ Y and vj = 0 for all j £ N there exists 1 < i < 
with K[i] C Q t and yet Vk[i] ^ S 1 . Then (29b) states X[Y,N) = 0. 


(f.3) For v £ F t define Y = {j £ Q t : Vj = 1} and N = Qt — Y. Then (29b) states A[F,7V] = A*. 


(f.Jy) When Y = N = 0 the right-hand side of (29b) is J2 v eF t A * ■ Hence we will have A[0, 0] = 1. 


(f.5) The A variables are the same as those in formulation \21$ . For any edge {t,t '} £ E(T ) and 
Y C Q t fl Qt> the terms on the right-hand side of the row f29W are a subset of the terms on 


the right-hand side of the row (24b) corresponding to Y (additional statements are possible). 


First we show that LP-GB is a relaxation for GB, in a strong sense. 


26 










Lemma 35 Let x be a feasible solution to an instance for GB. 


(i) The re is a feasible, 0/1-valued solution (X,A) to (29) such that for each variable X[Y,N] in 


(29( we have X[Y,N] = Y\ jeY ^j Iljeivt 1 ~ %)• 


(ii) As a corollary Y( r j = iCjX[{j},9\ = c T x. 


/■(I %)■ 


Proof (i) For each variable X[Y, N] in problem (29 1 we set X[Y,N] = YljeY E' - 

Further, for each t £ V(T) let v(t) £ {0,1}^* be the restriction of x to Q t , i.e. v(t)j = Xj for each 
j £ Q t - Since x is feasible, v(t) £ F t . Then we set AL t ) = 1 and A* = 0 for every vector v £ F t with 
v 7 ^ v(t). By construction for every t £ V(T) and (' Y,N ) £ Q t we have X[Y,N] = 1 iff v(t)j = 1 for 
all j £ Y and v(t)j = 0 for all j £ N; in other words (29b I is satisfied. 

(ii) This follows from (i). ■ 

As a consequence of Lemma |35[ Theorem [27] will follow if we can prove that the constraint 


matrix in (291 defines an integral polyhedron. This will be done in Lemma 39 given below. In what 


follows, we will view T as rooted, i.e. all edges are directed so that T contains a directed path from 
an arbitrarily chosen leaf vertex r (the root of T) to every other vertex. If (v, u) is an edge thus 
directed, then we say that v is the parent of u and u is a child of v. 

Definition 36 A rooted subtree T is a subtree of T, such that there exists a vertex u of T so that 
T contains a directed path from u to every other vertex of T. We then say that T is rooted at u. 


Definition 37 Let T be a rooted subtree ofT. 

(a) We denote by D(T) the set Ufef 

(b) We denote by V(T) the set { j : j £ Qt for some t £ T}. 


Below we will prove the following result: 


Theorem 38 Let (X, A) be a feasible solution to the LP-GB problem (29). Then for every rooted 
subtree T there is a family of vectors 

p k ' f £ { 0 , 


vectors 

^,f £ {0; i}V(f) 

and reals 

0 < pi k ’ f < 1 , 

(fe = 1,2,..., n(T)) satisfying the following properties: 

(a) For each 1 < k < n(T) and each constraint 1 < i < m of problem GB, if K\i\ C Q t for some 
t £ T, then x k ^ £ S l . 

(b) For 1 < k < n(T) and each pair (Y, N ) £ f l(T), 

P’AVJV] = || || (i-**/)- 

jeY jeN 

As a result, for each 1 <k< n(T) and j £ V(T), x k ’^ = p k ’^[{j}, 0]. 

(c) E:2V fc ’ f i. 

(d) For each ( Y,N ) £ Ll(T), 

n(T) 

X[Y,N} = 

k= 1 
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The family of vectors p kT and reals p k,T will be called a decomposition of ( X, A) over T. 


Pending a proof of Theorem |38[ we can show that the polyhedron defined by the constraints in 
LP-GB is integral. 


Lemma 39 The polyhedron defined by 1 29b)- (29c ) is integral and problems GB and LP-GB have 
the same value. 


Proof. Let (A, A) be a feasible solution to LP-GB. We apply Theorem 38 with T = T obtaining 
a family of vectors p k £ {0, vectors x k £ {0,1|™ and reals p k , for 1 < k < n(r ), satisfying 

conditions (a)-(d) of the theorem. By (a) and RemarkD9] each vector x k is feasible for GB. By ( d), 
the vector X is a convex combination of the vectors p k . This completes the proof, using Remark 
(f.3) to handle the A* variables. ■ 
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This result completes the proof of Theorem D7l pending Theorem 38 


C.2 Proof of Theorem I 


Assume we have a feasible solution (A, A) to (29). The proof of Theorem 38 will be done by induction 
on the size of T. First we handle the base case. 

Lemma 40 If T consists of a single vertex u there is a decomposition of ( X , A) over T. 


Proof. We have that f 1(T) = fl u (see Definition 37). By (29c) we have Y1 v gf u = 1- Let n(T) > 0 


be the number of elements v £ F u with A“ > 0 and denote these vectors by {u’(l),... ,w(n(T))}. 
Then, for 1 < k < n(T) let x k A = w(k) and p k ’^ = A“, fc ^. Finally, for 1 < k < n(T ) we define the 
vector p k ' T £ Ll u by setting 


/’ T 


[y,jv] = || 

jeY j<=N 


for each pair ( Y,N) £ fl u . Now we will verify that conditions (a)-(d) of The orem 38 hold. Clearly 
(a)-(c) hold by construction. To see that (d) holds, note that (A, A) satisfies (29b), i.e., 


X[Y,N) = 


i k.T 
1 “ X 3 


yi n na - v ^ 

v&f u jeY jeN 

n(f) 

= E ^ II -f II ( 

k —i jeY jeN 

n(T) 

= a] 

fc=1 

which is condition (e), as desired. ■ 

Next we prove the general inductive step needed to establish Theorem [38] The technique used 
here is related to the junction tree theorem, is similar to one used in m and is reminiscent of 
Lemma L of 1391. 


Consider a vertex u of T and a subtree T rooted at u with more than one vertex. Let v be a 
child of u. We will apply induction by partitioning T into two subtrees: the subtree L consisting 
of v and all its descendants in T, and the subtree H = T — L. Consider a decomposition of (A, A) 
over L given by the vectors p k,L £ {0, and the positive reals p k,L for k = 1,2,..., n(L), and 

a decomposition of (A, A) over H given by the vectors p k ’ H £ {0, and the positive reals p k ’ H 

for fc = 1,2,..., n(H). 
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Denote by V the set of partitions of Q u fl Q v into two sets. Thus, by Definition [33j for each 
(cq/3) £ V we have a variable X[a,/.3], Note that O(T) = D(Lf) U D(L). We construct a family of 


38 


for T, as follows. 


vectors and reals satisfying (a)-(d) Theorem 

For each (a, (3) £ V such that X[a,/3] > 0, and each pair i, h such that 1 < * < n(L), 
1 < h < n(H ), and p h ’ H [a,/3\ = p x ’ L [a,(3\ = 1 we create a vector and a real 7 using 
the rule: 

For any vertex t in T and (Y, N) £ fl t : 

(r.l) If t £ V(L) we set q*jf [Y, N] = p l,L [Y,N]. 

(r.2) If t £ V(H) we set qf^\Y,N] = p h ’ H [Y,N], 

Further, we set 

a ,p = iS' L n h ' H 

llh x[<*,0\ ' 

To argue that this construction is valid we note that since X[a , 0\ > 0, pairs of indices i, h as listed 
above must exist, by (d) of the inductive assumption applied to H and L. Furthermore, we have 

7 “/ > 0 . 

Now we will prove that the qih and the 7 , 7 , provide a decomposition of (X, A) over T. Let i and h 
be given. Since the restriction of p x,L (and p h,H ) to L (resp., H) satisfy (a) and (b) of the inductive 
assumption, so will q^- Thus, there remains to prove (c) and (d). 

First, consider (d). Let ( Y,N) £ D(T), say ( Y,N ) £ Q(H). We claim that 


£ 7S S 9,“d[r,A r ] = 


n (L) n(H) iL hH 

E EE E r M 1 p hL i^p}p hM &p]p h ’ H %m- (30a) 


a,/3,i,h 


(a,/3)eV:X[a,p]>0 1 


t=i ti X M\ 


This equation holds because in any nonzero term in either expression we must have p l ’ L [a,(3\ = 
p h,H [a,j.3} = 1 and since (' Y,N ) £ Q(H) we also have that qf^[Y,N] = p h ’ H [Y,N]. 

Now the right-hand side of (30a) equals 


z(L) 


E 

(a,P)eV:X[a,P]>0 

E 

(a,P)eV : X[ot,P]>0 \h=l 


i{H) 




h =1 


2(H) 


E H p h ’ H [a,p]p h ’ H [Y,N] , 


(31a) 

(31b) 


by the inductive assumption (d) applied to subtree L. The expression in (31b) equals 

E P KH [<yP\ P KH p hM [Y,N} ) . 

(a,p)ev : X[oc,p ]>0 



(32a) 


By inductive property (b) applied to subtree H , given 1 < h < n(H) we have that p h,H [a,/3] = 1 
for exactly one partition (a, (3) £ V, and so expression (32a) equals 


In summary, 


n(H) 

E » h ’ H P h ’ H [Y,N\- 

h=l 

n(H) 

E 7“AEm = E^’VEw 

Oi,(3,i,h h= 1 


( 33 ) 
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and by induction applied to the subtree H this quantity equals X\Y, N], Thus property (d) does 
indeed hold. 

Finally we turn to (c). Inductively, (c) and (d) hold for trees L and H. Thus, as noted in Remark 
f.4), X[0,0] = 1. But we have just shown that (d) holds for T, and in particular that it holds for 
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Y = N = 0. Using Remark 34 4.4) we obtain that (c) holds for T, as desired. 
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